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Abstract — In distributed storage systems that employ era- 
sure coding, the issue of minimizing the total repair bandwidth 
required to exactly regenerate a storage node after a failure 
arises. This repair bandwidth depends on the structure of 
the storage code and the repair strategies used to restore the 
lost data. Minimizing it requires that undesired data during a 
repair align in the smallest possible spaces, using the concept 
of interference alignment (IA). Here, a points-on-a-lattice 
representation of the symbol extension IA of Cadambe et al. 
provides cues to perfect IA instances which we combine with 
fundamental properties of Hadamard matrices to construct 
a new storage code with favorable repair properties. Specifi- 
cally, we build an explicit (k + 2,k) storage code over QF(3), 
whose single systematic node failures can be repaired with 
bandwidth that matches exactly the theoretical minimum. 
Moreover, the repair of single parity node failures generates 
at most the same repair bandwidth as any systematic node 
failure. Our code can tolerate any single node failure and any 
pair of failures that involves at most one systematic failure. 

I. Introduction 

The demand for large scale data storage has increased 
significantly in recent years with applications demanding 
seamless storage, access, and security for massive amounts 
of data. When the deployed nodes of a storage network 
are individually unreliable, as is the case in modern 
data centers, or peer-to-peer networks, redundancy through 
erasure coding can be introduced to offer reliability against 
node failures. However, increased reliability does not come 
for free: the encoded representation needs to be main- 
tained posterior to node erasures. To maintain the same 
redundancy when a storage node leaves the system, a new 
node has to join the array, access some existing nodes, and 
regenerate the contents of the departed node. This problem 
is known as the Code Repair Problem (3), (TJ. 

The interest in the code repair problem, and specifically 
in designing repair optimal (n, k) erasure codes, stems 
from the fact that there exists a fundamental minimum 
repair bandwidth needed to regenerate a lost node that 
is substantially less than the size of the encoded data 
object. MDS erasure storage codes have generated par- 
ticular interest since they offer maximum reliability for a 
given storage capacity; such an example is the EvenOdd 
construction (2). However, most practical solutions for 



storage use existing off-the-shelf erasure codes that are 
repair inefficient: a single node repair generates network 
traffic equal to the size of the entire stored information. 

Designing repair optimal MDS codes, i.e., ones achiev- 
ing the minimum repair bandwidth bound that was derived 
in (3|, seems to be challenging especially for high rates 
^ > |. Recent works by Cadambe et al. [11] and Suh 
et al. 1 12] used the symbol extension IA technique of 
Cadambe et al. |4] to establish the existence, for all n, k, of 
asymptotically optimal MDS storage codes, that come ar- 
bitrarily close to the theoretic minimum repair bandwidth. 
However, these asymptotic schemes are impractical due 
to the arbitrarily large file size and field size that they 
require. Explicit and practical designs for optimal MDS 
storage codes are constructed roug hly for rates | < \ 151- 
fTQ) , (13), and most of them are based upon the concept of 
interference alignment. Interestingly, as of now no explicit 
MDS storage code constructions exist with optimal repair 
properties for the high data rate regime Q 

Our Contribution: In this work we introduce a new 
high-rate, explicit, (fc+2, fc) storage code over GF(3). Our 
storage code exploits fundamental properties of Hadamard 
designs and perfect IA instances pronounced by the use 
of a lattice representation for the symbol extension IA of 
Cadambe et al. |4|. This representation gives hints for 
coding structures that allow exact instead of asymptotic 
alignment. Our code exploits these structures and achieves 
perfect IA without requiring the file size or field size to 
scale to infinity. Any single systematic node failure can be 
repaired with bandwidth matching the theoretic minimum 
and any single parity node failure generates (at most) 
the same repair bandwidth as any systematic node repair. 
Our code has two parities but cannot tolerate any two 
failures: the form presented here can tolerate any single 
failure and any pair of failures that involves at most one 



during the submission of this manuscript, two independent works 
appeared that constructed MDS codes of arbitrary rate that can optimally 
repair their systematic nodes, see [14], [131. 
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Fig. 1. A (A; + 2, k) Coded Storage Array. 
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Fig. 2. Repair of a (4, 2) code. 



systematic node failur^] Here, in contrast to MDS codes, 
slightly more than k, that is, k (l + encoded pieces 
are required to reconstruct the file object. 

II. Distributed Storage Codes with 2 Parity 
Nodes 

In this section, we consider the code repair problem 
for storage codes with 2 parity nodes. Let a file of size 
M = kN denoted by the vector f G ¥ kN be partitioned 
in k parts f = [f^ . . . f J] , each of size Njn We wish 
to store this file with rate across k systematic and 2 
parity storage units each having storage capacity ¥- = N. 
To achieve this level of redundancy, the file is encoded 
using a (k + 2, k) distributed storage code. The structure 
of the storage array is given in Fig. 1, where A^ and are 
N x N matrices of coding coefficients used by the parity 
nodes a and b, respectively, to "mix" the contents of the 
zth file piece f^. Observe that the code is in systematic 
form: k nodes store the k parts of the file and each of the 
2 parity nodes stores a linear combination of the k file 
pieces. 

To maintain the same level of redundancy when a node 
fails or leaves the system, the code repair process has to 
take place to exactly restore the lost data in a newcomer 
storage component. Let for example a systematic node 
i G {1, . . . , k} fail. Then, a newcomer joins the storage 
network, connects to the remaining fc + 1 nodes, and has to 
download sufficient data to reconstruct . Observe that the 
missing piece exists as a term of a linear combination 
only at each parity node, as seen in Fig. 1. To regenerate 
it, the newcomer has to download from the parity nodes at 
least the size of what was lost, i.e., TV linearly independent 
data elements. The downloaded contents from the parity 
nodes can be represented as a stack of TV equations 
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2 Our latest work expands Hadamard designs to construct 2-parity MDS 
codes that can optimally repair any systematic or parity node failure 
and m-parity MDS codes that can optimally repair any systematic node 
failure [16]. 

3 F denotes the finite field over which all operations are performed. 



where p\ a \ GF? are the equations downloaded from 
parity nodes a and b respectively. Here, V- a \V-^ G 
¥ N x "2" denote the repair matrices used to mix the parity 
contents n Retrieving from jn]) is equivalent to solving 
an underaetermined set of A/equations in the kN un- 
knowns of f , with respect to only the TV desired unknowns 
of . However, this is not possible due to the additive in- 
terference components that corrupt the desired information 
in the received equations. These terms are generated by the 
undesired unknowns fj, j ^ i, as noted in dnb. Additional 

stem 



data need to be downloaded from the systematic nodes, 
which will "replicate" the interference terms and will 
be subtracted from the downloaded equations. To erase 
a single interference term, a download of a basis of 
equations that generates the corresponding interference 

~(A.V< a >)^ 



term, say 
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fj, suffices. Eventually, when all 



undesired terms are subtracted, a 
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f ? has to be 



formed. Thus, it can be proven that the repair bandwidth 
to exactly regenerate systematic node i is given by 

k 

7l = iV+ fi rank([A,V< a) B J vf ) ]), 

where the sum rank term is the aggregate of interference 
dimensions. Interference alignment plays a key role since 
the lower the interference dimensions are, the less repair 
data need to be downloaded. We would like to note that 
the theoretical minimum repair bandwidth of any node for 
optimal (k + 2, k) MDS codes is exactly (k + l)y, i.e. 
half of the remaining contents; this corresponds to each 
interference spaces having rank y. This is also true for 
the systematic parts of non-MDS codes, as long as they 
have the same problem parameters that were discussed in 
the beginning of this section, and all the coding matrices 
have full rank N. An abstract example of a code repair 
instance for a (4, 2) storage code is given in Fig. 2, where 
interference terms are marked in red. 

To minimize the repair bandwidth 7^ we need to care- 
fully design both the storage code and the repair matrices. 

4 Here, we consider that the newcomer downloads the same amount of 
information from both parities. In general this does not need to be the 



In the following, we provide a 2-parity code that achieves 
optimal systematic and near optimal parity repair. 

III. A New Storage Code 

We introduce a (k + 2, k) storage storage code over 
GF(3), for file sizes M = k2 k , with coding matrices 



*-N i 



(2) 



where N 2 k , I 2 <-i ® blkdiag ^1*, -Iivj, and 

i G {1, . . . , k}. In Fig. 3, we give the coding matrices of 
the (5,3) version of the code. 

Theorem 1: The code in ^ has optimally repairable 
systematic nodes and its parity nodes can be repaired by 
generating as much repair bandwidth as a systematic repair 
does. It can tolerate any single node failure, and any pair 
of failures that contains at most one systematic failure. 
Moreover, to reconstruct the file at most fc+^ coded blocks 
are required. 

In the following, we present the tools that we use in our 
derivations. Then, in Sections V and VI we prove Theorem 
1. 

IV. Dots-on-a-Lattice and Hadamard Designs 

Optimality during a systematic repair, requires inter- 
ference spaces collapsing down to the minimum of y, 
out of the total N, dimensions. At the same time, useful 
data equations have to span N dimensions. For the con- 
structions presented here, we consider that the same repair 
matrix is used by both parities, i.e., V- 1 ^ = V- 2 ^ = V^. 
Hence, for the repair of systematic node i G {1, . . . , k} 
we optimally require 

N 

y 

for all j G {1, . . . , k}\i, and at the same time 
rankQV, X,V,]) = TV. 



rankQV, X,V,]) 
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(4) 



The key ingredient of our approach that eventually pro- 
vides the above is Hadamard matrices. 

To motivate our construction, we start by briefly dis- 
cussing the repair properties of the asymptotic coding 
schemes of (TTJ, |T2| . Consider a 2-parity MDS storage 
code that requiresnle sizes M = k2A k_1 , i.e., N = 
2A k ~ 1 . Its N x N diagonal coding matrices {X s }^ =1 
have i.i.d. elements drawn uniformly at random from some 
arbitrarily large finite field F. During the repair of a 
systematic node i G {1, . . . , k}, the repair matrix that 
is used by both parity nodes to mix their contents, has as 
columns the ^ = A fc_1 elements of the set 



Vi=l n X: s w:x s G{0,...,A-l} 



(5) 



Then, we define a map C from vectors in the set 
|ris=i XJ s w : x s G Z j to points on the integer lattice 
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Fig. 4. Here we have k = 3, f = 4, and A = 2. 
Moreover, £(V 3 ) = {(0, 0, 0), (0, 1, 0), (1, 0, 0), (1, 1, 0)}, 
£(XiV 3 ) = {(1,0,0), (1,1,0), (2, 0,0), (2, 1,0)}, and 
£(X 2 V 3 ) = {(0, 1, 0), (0, 2, 0), (1, 1, 0), (1, 2, 0)}. 



7L k \ Yl s =i-^s sw ~~ ^ *}2s=i x s e s> where e s is the s- 
th column of Ifc+i. Now, consider the induced lattice 
representation of V* 



£( V *)=< J2 *-e a ; x, e{0,...,A-l} L (6) 

^s=l,s^i J 

Observe that the i-th dimension of the lattice where C(Vi) 
lies on, indicates all possible exponents X{ of X$. Then, 
the products XjV$, j ^ i, and X^V^ map to 

C(XjVi) = <(xj + l)e,+ ^x s e s ; x s G {0, . . . , A - 1} 

k 



and 



£(X i V i ) = je i + ^e,; x s G {0,...,A-1}|, 



respectively. In Fig. 2, we give an illustrative example for 
k = 3, and A = 2. 

Remark 1: Observe how matrix multiplication of X^ 
and elements of V z manifests itself through the dots-on-a- 
lattice representation: the product of X^ with the elements 
of Vi shifts the corresponding arrangement of dots along 
the x^-axis, i.e., the x z -coordinate of the initial points gets 
increased by one. 

Asymptotically optimal repair of node i is possible due 
to the fact that interference spaces asymptotically align 

rankQV, X.VJ) _ |£(V<) U CQLjVJl 



N 
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^(VOI+cKA*- 1 ) 



(7) 



and useful spaces span N dimensions, that is, 
rank([V i XiV*]) = |£(Vi) U £(XiV<)| = 2A k ~\ 
with arbitrarily high probability for sufficiently large field 



sizes. 



The question that we answer here is the following: How 
can we design the coding and the repair matrices such that 
i) exact interference alignment is possible and ii) the full 
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Fig. 3. The coding matrices of a repair optimal (5, 3) code over GF(3). 



rank property is satisfied, for fixed in k file size and field 
size? We first address the first part. We want to design the 
code such that the space of the repair matrix is invariant 
to any transformation by matrices generating its columns, 
i.e., £(XjVi) = C(Vi). This is possible when 

CQtjVi) = {{xj + l)ej+J2x s e s - x s € {0, . . . , A - 1} 

= bjej+^xses; x s € {0,...,A- 1}1 = £(V;), 

I s=l,s^j J 

that is, when the matrix powers "wrap around" upon 
reaching their modulus A. This wrap-around property is 
obtained when the diagonal coding matrices have elements 
that are roots of unity. 

Lemma 1: For diagonal matrices, Xi, . . . , whose 
elements are A-th roots of unity, i.e., X^ = X^, for all 
s G {1, . . . , k}, we have that C(Xj V*) = £(V;), for all 
i G {l,...,/c}\j. 

However, arbitrary diagonal matrices whose elements 
are roots of unity are not sufficient to ensure the full rank 
property of the useful data repair space [V^ X^Vi]. In 
the following we prove that the full rank property along 
with perfect I A is guaranteed when we set N = 2 k , X$ = 
I 2 <-i 0blkdiag (ir^ , — Ijv^, and consider the set 
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IJxf*w:^e{o,i}L 



(8) 



Interestingly, there is a one-to-one correspondence be- 
tween the elements of Hat and the columns of a Hadamard 
matrix. 

Lemma 2: Let an N x N Hadamard matrix of the 
Sylvester's construction 
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(9) 



with Hi = 1. Then, is full-rank with mutually 

orthogonal columns, that are the N elements of Hn- 
Moreover, any two columns of H^r 



differ in y positions. 



The proof is omitted due to lack of space. To illustrate the 
connection between Hn and we "decompose" the 
Hadamard matrix of order 4 
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Fig. 5. The coding matrices of our (6, 4) code are given. We illustrate 
the "absorbing" properties of the repair matrix for systematic node 1. The 
column space of the repair matrices is invariant to the corresponding blue 
blocks. This results in interference spaces aligning in exactly half of the 
dimensions available. 



where Xi 




and X 2 = diag 



Due to 



the commutativity of Xi and X 2 , the columns of H 4 are 
also the elements of H4 = {w, Xiw, X 2 w, X1X2W}. 

By using Hat as our "base" set, we are able to ob- 
tain perfect alignment condition due to the wrap around 
property of it elements; the full rank condition will be also 
satisfied due to the mutual orthogonality of these elements. 

V. Repairing Single Node Failures 

A. Systematic Repairs 

Let systematic node i G {1, . . . , k} fail. Then, we pick 
the columns of the repair matrix as a set of y vectors 
whose lattice representation is invariant to all X^s but to 
one key matrix X^. We specifically construct the TV x 
Y repair matrix whose columns have a one-to-one 
correspondence with the elements of the set 



K I [I Xf*w:z s G{0,l} 



(ID 



First, observe that is full column rank since it is a 
collection of y distinct columns from Hn- Then, we have 
the following lemma. 

Lemma 3: For any z, j G {1, 2, ... , k}, we have that 



rankQV, X, V,]) = |£(V,) U C (X, V*)| 

i =j 



AT, i = j 
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(12) 



The above holds due to each element of Hn being 
associated with a unique power tuple. Then, the columns 
of [Vi X^Vi] are exactly the elements of Hat, since 



£(v,)u£(x,v,) = < xie{0,l} \ 

I s — 1 ,s^i 

( k 1 (13) 

= £(Hjv). 

Moreover, the set of columns in are identical to the set 
of columns of XjV^, i.e., C(Vi) = £(XjV^), for j ^ i, 
due to Lemmata 1 and 2. Therefore, the interference spaces 
span y dimensions, which is the theoretic minimum, and 
the desired data space during any systematic node repair 
is full-rank, since it has as columns all columns of H^. 

Hence, we conclude that a single systematic node of the 
code can be repaired with bandwidth (fc + l)y = ^-M. 
In Fig. 4, we depict a (6, 4) code of our construction, along 
with the illustration of the repair spaces. 

B. Parity repairs 

Here, we prove that a single parity node repair gener- 
ates at most the repair bandwidth of a single systematic 
repair. Let parity node a fail. Then, observe that if the 
newcomer uses the N x N repair matrix = Xi to 
multiply the contents of parity node b, then it downloads 

x i (Ei=i x i f i) = f i +£i=2 x i x i f i- Observe, that the 
component corresponding to systematic part fi appears the 
same in the linear combination stored at the lost parity. By 
Lemma 2, each of the remaining blocks, XiX^ share 
exactly y indices with equal elements to the same y 
indices of X^fj which was lost, for any i G {2, . . . , k}. 
This is due to the fact that the diagonal elements of 
matrices XiX^ and X^ are the elements of some two 
columns of H^. Therefore, the newcomer has to download 



connect to nodes {1, — 2} and both parities: 



from systematic node j G {2, . . . , k}, the y entries 
that parity a's component Xjfj differs from the term 
XiXjfj of the downloaded linear combination. Hence, 
the first parity can be repaired with bandwidth at most 
N + (k - l)f = (k + l)f j^The repair of parity node b 
can be performed in the same manner. 



VI. Erasure Resiliency 

Our code can tolerate any single node failure and any 
two failures with at most one of them being a systematic 
one. A double systematic and parity node failure can be 
treated by first reconstructing the lost systematic node 
from the remaining parity, and then reconstructing the 
lost parity from all the systematic nodes. However, two 
simultaneous systematic node failures cannot be tolerated. 
Consider for example the corresponding matrix when we 

5 By "at most" we mean that this result is proved using an achievable 
scheme, however, we do not prove that it is optimal. 
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having rank For these 



cases, an extra download of y equations is required to 
decode the file, i.e., an aggregate download of kN + y 
equations, or k + \ encoded pieces. 
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